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We show that the Glauber dynamics on proper 9-colourings of the tri- 
angular lattice is rapidly mixing, which allows for efficient sampling. Con- 
sequently, there is a fully polynomial randomised approximation scheme 
(FPRAS) for counting proper 9-colourings of the triangular lattice. Proper 
colourings correspond to configurations in the zero-temperature anti-ferro- 
magnetic Potts model. We show that the spin system consisting of proper 
9-colourings of the triangular lattice has strong spatial mixing. This im- 
plies that there is a unique infinite-volume Gibbs distribution, which is an 
important property studied in statistical physics. Our results build on pre- 
vious work by Goldberg, Martin and Paterson, who showed similar results 
for 10 colours on the triangular lattice. Their work was preceded by Salas 
and Sokal's 11-colour result. Both proofs rely on computational assistance, 
and so does our 9-colour proof. We have used a randomised heuristic to 
guide us towards rigourous results. 



1 Introduction 

This paper is concerned with proper 9-colourings of the triangular lattice. A 
q-colouring of a graph G is an assignment of colours from the set {1, ...,(?} 
to the vertices. A colouring is proper if adjacent vertices receive different 
colours. There are two fundamental problems that have been studied ex- 
tensively: sampling g-colourings from the uniform distribution on proper 
g-colourings of G and counting the number of proper g-colourings of G. 
Both these problems have been studied for various graphs and number of 
colours, and it has been shown that they are intimately related. More pre- 
cisely, if there is an efficient method of finding a good approximate solution 
to one of the problems, then there is an efficient method of finding a good 
approximate solution to the other problem. The problem of counting q- 
colourings of a graph with maximum degree A is #P-complete when q ^ 3 



and A ^ 3 (see [3]). Hence we have to rely on approximate solutions. A 
great deal of work has been put into mapping out for which graphs and num- 
ber of colours there is a fully polynomial randomised approximation scheme 
(FPRAS) for counting colourings, or equivalently, a fully polynomial almost 
uniform sampler (FPAUS). 

Sampling colourings is closely related to problems arising in statisti- 
cal physics, where proper colourings correspond to configurations in the 
zero-temperature anti-ferromagnetic Potts model. The notion of a proper 
colouring imposes a very local property: it constraints the allowed colours 
on two adjacent vertices. Given local constraints, physicists are interested 
in understanding the macroscopic properties of the system. 

Let C be an infinite graph. Think of C as lattice graph, for example 
the grid. For a finite subgraph G of C, the boundary of G is the set of 
vertices in C that are adjacent to G but are not in G themselves. Given a 
colouring B of the boundary, a proper g-colouring of G agrees with B if no 
vertex adjacent to the boundary receives any of the colours of its boundary 
neighbours. Let 7r# denote the uniform distribution on proper q-colourings 
of G that agree with B. Suppose ire is a distribution on the set of proper 
g-colourings of the infinite graph C. For a proper q-colouring C of £., let 
irc( • | C(£ \ G)) denote the conditional distribution on proper q-colouring 
of G induced by ire when the colours of all vertices except for those in G 
are specified by C. The distribution ttc is an infinite-volume Gibbs distribu- 
tion (with respect to proper colourings) if, for any proper (/-colouring C of C, 
irc( • | C(C\G)) = irjg, where B is the colouring of the boundary of G induced 
by C. That is, the conditional distribution on colourings of G depends only 
on the boundary of G and not on other vertices of C The physical intuition 
for an infinite-volume Gibbs distribution is that it describes a macroscopic 
equilibrium, for which all parts of the system are in equilibrium with their 
boundaries. It is known that there always exists at least one infinite- volume 
Gibbs distribution, and a central question in statistical physics is to deter- 
mine whether it is unique or not. The phenomenon of non- uniqueness is 
referred to as a phase transition. For more on Gibbs distributions, see, for 
example, [6] or [7]. 

Given a finite subgraph G of C and a boundary colouring B of G, let G' 
be any connected subset of the vertices of G and let 8 be the distribution on 
colourings of G' that is induced by 7rg. The question we ask is how does 8 
change if we change the colour of one single vertex on the boundary of G. If 
G' is far away from the boundary then we would expect that changing the 
colour of a boundary vertex does not have too much influence on 8. If this 
is true, then we have a property known as strong spatial mixing. The exact 
definition will be given later. A consequence of strong spatial mixing is that 
the infinite- volume Gibbs distribution is unique [2TJ [22] . In this paper 
we show that there is strong spatial mixing for 9 colours on the triangular 
lattice. 
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Another important problem in statistical physics is to determine how 
quickly a system converges to equilibrium. This could lead to insights in 
how the system returns to equilibrium after a shock has disturbed it. A 
dynamical process that is commonly studied is the Glauber dynamics. The 
Glauber dynamics is a Markov chain on the set of proper g-colourings of 
a graph G. A transition from one state (colouring) to another is made 
as follows. Choose a vertex v S G uniformly at random and choose one 
of the q colours uniformly at random. If the randomly selected colour is 
different from the colours of the neighbours of v then recolour v with the 
selected colour. This procedure is know as a single-vertex heat-bath update. 
If it is repeated over and over then the distribution of the current state 
of the Markov chain will converge towards the uniform distribution. The 
question is how long to run the dynamics for in order to get close to the 
uniform distribution. If we only need to simulate the dynamics over a small 
number of steps then the Glauber dynamics would be a suitable tool for 
sampling colourings. We say that a Markov chain is rapidly mixing if a 
small number of steps are sufficient in order to get arbitrarily close to its 
stationary distribution. Exact definitions will be given later. It is a well- 
known fact that if the system has strong spatial mixing, then the Glauber 
dynamics is (often) rapidly mixing [SJIUJIEI]. I n fact, the converse is also 
true. Since we prove strong spatial mixing for 9 colours on the triangular 
lattice, we also prove that the Glauber dynamics is rapidly mixing. That is, 
we show that there is an FPAUS for sampling 9-colourings of the triangular 
lattice. As mentioned above, the FPAUS implies that there is an FPRAS 
for counting 9-colourings. 

1.1 Organisation 

The remainder of the paper is organised as follows. In Section [2j we de- 
fine the basic notation, strong spatial mixing and the Glauber dynamics. 
In Section [3j we formally state our results and discuss related work. In 
Section HI we introduce two key concepts that will be used throughout the 
paper, and Sections [5] through [6] contain the technical part that leads to the 
strong spatial mixing result. Section [7] is about the implication from strong 
spatial mixing to a rapidly mixing Glauber dynamics. Finally, in Section [HI 
we describe the computer assisted parts of our proofs. 

2 Preliminaries 

2.1 Basic notation 

Let the infinite graph T = (Vjr, Ej-) denote the triangular lattice, formally 
defined as follows. There is a bijection £ from Vqr to {(x, y) \ x,y G Z and 
x + y is even} such that £,~ l {x, y) S Vjr has the six neighbours y + 2), 
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r l {x + l,y + 1), + 1,1/ - 1), C\x,y - 2), r 1 ^ - 1,1/ - 1) and 

maps a vertex of V7- to a coordinate in a 
Cartesian coordinate system, which allows us to draw the triangular lattice 
as illustrated in Figure [TJ However, throughout this article we will draw 
vertices (IS (I hexagons, illustrated in Figure [2j 

The bijection £ specifies a unique clockwise ordering of edges incident 
to the same vertex. We write e% ~< ■ ■ ■ ~< e& to indicate that e% — > e.% — > 
• • • — > efc — >• ei — >■ • • ■ is the clockwise ordering of the edges ei, . . . , around 
some vertex u. We say that two edges e\ and ei incident to the same 
vertex v are clockwise adjacent if either e\ -< ei ■< e^ -< e^ -< e$ -< e$ or 
^2 ~< ei ~< £3 -< 64 -< es -< eg, where e^, . . . , eg are the other four edges in 
E7- incident to v. 

A region R is a finite subset of V7- and its edge boundary, denoted 6R = 
{{v,w} I {v,w} £ Ej-,v £ -R, u> ^ -R}. The vertex boundary of i?, denoted 
= | {v, e£R,w<£ R}. 

For a region R, v £ R and £ clR, we let dn(w, v) denote the number 
of edges on a shortest path P in T between w and v such that all vertices 
in P except for w are in R. Thus, dn(w, v) = 1 if v and w are adjacent. For 
a subregion i?' C i?, we let dn(w,R') = min ug R/ dn(w,u). 

Let [q] = {l,...,q}. A q-colouring C of a set S (a set of vertices or 
edges), is a function from 5 to [q], where [q] represents a set of q colours. A 
partial q-colouring of 5 is a function from S to [q]U{0}, where the additional 
colour can be thought of as representing "no colour." 

A q-colouring C of a region R is proper if for all u,v £ R, C(u) 7^ C(v) 
when {u, v} £ E-r. For a partial q-colouring B of the boundary dR, a 
q-colouring C of R agrees with if for all {w,v} £ B(w) 7^ C(v). 
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Similarly, for a partial g-colouring B of £R, a (/-colouring C of R agrees 
with B if for all {w,v} € £R, B({w,v}) ^ C(v). To aid the reader, we 
use the calligraphic font to represent colourings of vertices and the normal 
font to represent colourings of edges (like B and B above). Further, partial 
colourings (containing the "colour" 0) are used only for boundary vertices 
and boundary edges and not for the regions themselves. 

For a region R and partial (/-colouring B of dR (respectively, B of £R), 
fig (respectively, fi B ) denotes the set of proper (/-colourings of R that agree 
with B (respectively, B). The uniform distribution on Q q B (respectively, Q q B ) 
is denoted tt b (respectively, tt b ). For a subregion R' C R, we let ir B (R') 
denote the distribution on proper (/-colourings of R' induced by 7r|>. 

For a distribution on a set S, we write Pie(x) for the probability of 
drawing x £ S under 0. The total variation distance between two distribu- 
tions 0i and 02 on a set S is 

dtv(ei,9 2 ) = l Pr ^ x ) " Pr ^( x )l = |Pr ei (5') - Vre 2 {S')\ • 

xes 

For two distributions 0\ and 02 on two sets S\ and S 2 , respectively, a 
coupling of #i and 02 is a joint distribution on S\ x S2 that has 0\ and #2 
as its marginal distributions. 

We write E[X] to denote the expected value of a random variable X. 

2.2 Strong spatial mixing 

We say that there is strong spatial mixing for q colours on the triangular 
lattice if there are two constants (3,(3' > (that may depend on q) such that 

d u {K q B {R'),ir B ,{R')) ^ (3\R'\e^' d ^ w ^ (1) 

is true for all regions R, subregions R' C i?, w £ 9i? and partial (/-colourings 
B and B' of 9i? such that B(v) = B'(v) for all v £ dR\ {w}, B{w) + B'(w), 
B(w) > and B'(w) > 0. 

2.3 The Glauber dynamics 

The Glauber dynamics for (/-colourings of a region R with partial (/-colouring 
B of dR is a Markov chain with state space Q B and the following transitions. 
The evolution from a colouring Ct £ fig to a new colouring Ct+i £ fig is 
defined by the following steps. 

1. Choose a vertex v from R uniformly at random. 

Let C be the colouring of R U dR specified by Ct and B. 

Let C n cighbours = {C(u) | {u, w} G £V}- 

2. Choose a colour c from [g] \ C ne i g hbours uniformly at random. 
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3. Let 

if u = v; 



C t +i(u) 




otherwise. 



The probability of not leaving a state in a transition is always positive. 
Hence the Glauber dynamics is an ergodic Markov chain if the number of 
colours q is sufficiently large; every state can be reached from any other 
state. In this article we are concerned with q = 9 colours for which we 
note that there are always at least three available colours in Step 2 of the 
dynamics above. Hence the Glauber dynamics is ergodic for q = 9. 

It is straightforward to verify that when the Glauber dynamics is ergodic 
then its stationary distribution is 7r|>. Let Oq be the distribution on fig after 
t steps of the Glauber dynamics, starting with colouring Co. For 5 > 0, the 
mixing time 

Tq{5) = max minjt | dtv(#c > ^ ^} 



is the number of transitions until the dynamics is within total variation 
distance 5 of the stationary distribution, assuming the worst initial colouring 
Co- We say that the Glauber dynamics is rapidly mixing if t%{5) is upper- 
bounded by a polynomial in \R\ and log(l/<5). 



2.4 Approximate counting 

A randomised approximation scheme (RAS) for a function / : X* — > N is a 
probabilistic Turing machine that takes as input a pair (x,e) € X* x (0, 1), 
and produces, on an output tape, an integer random variable Y satisfying 
the condition Pr(e _e ^ Y/f(x) ^ e e ) ^ |. The choice of the value | is 
inconsequential: the same class of problems has a RAS if we choose any 
probability in the interval (5, 1) (see for example [IS]). A fully polynomial 
randomised approximation scheme (FPRAS) is a RAS that runs in time 
upper-bounded by a polynomial in \x\ and e^ 1 . 

It was mentioned in the introduction that the existence of an efficient 
method for sampling colourings implies that there is an FPRAS for counting 
the number of colourings. We could use a rapidly mixing Glauber dynamics 
to construct (in a non-trivial way) an FPRAS for estimating |Ol|. For details 
on the topic of how sampling and counting are related, see, for example, [14| 
or [TS]. 

3 Our results and related work 

These are the two main theorems of the paper. 

Theorem 1 . There is strong spatial mixing for 9 colours on the triangular 
lattice. 
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Theorem 2. The Glauber dynamics on 9-colourings of a region R of the 
triangular lattice, with partial 9-colouring B of dR, has mixing time t^(5) G 
0(n 2 + re log 4), where n = \R\. 

The previously best known mixing results on the triangular lattice was 
given for 11 colours by Salas and Sokal [19] in 1997, and later improved 
by Goldberg, Martin and Paterson [S] to 10 colours in 2004. Both proofs 
involved computational assistance. 

To place these results in context, we first mention some general mixing 
bounds that are applicable to graphs with small girth, such as many of those 
lattices studied in statistical physics. Independently, Jerrum [13] and Salas 
and Sokal [19] proved that for proper (/-colourings on a graph of maximum 
degree A, the Glauber dynamics has 0{n log n) mixing time when q > 2A, 
where n is the number of vertices. For q = 2A, Bubley and Dyer [2] showed 
that it mixes in 0(ra 3 ) time, and later Molloy [T7] showed that it mixes 
in 0{n log n) time. In [2D], Vigoda used a Markov chain that differs from 
the Glauber dynamics and showed that it has 0(n log n) mixing time when 
q > (11/6) A. This result implies that also the Glauber dynamics is rapidly 
mixing for q > (11/6) A. Goldberg, Martin and Paterson [£] showed that any 
triangle free graph has strong spatial mixing provided q > oA — 7, where a 
is the solution to a a = e (a « 1.76322) and 7 = 4q3 ^gfer^ ~ 0.47031. 
For triangle free graphs of low degree, this is still today the best general 
mixing bound that has been proved. 

The technique Goldberg, Martin and Paterson used in [9] can be tailored 
and tweaked for particular graphs in order to give mixing bounds that are 
better than the general bound. This was demonstrated in [5] for the lattice 
Z 3 with q = 10 colours (the general result would give mixing for q = 11 
colours), and the triangular lattice with q = 10 colours. Although the 
general result holds only when the graph is triangle free, the tailored proof 
for the triangular lattice does not require this. Both proofs were computer 
assisted, where the computational part consisted of looping though a huge 
number of boundary colourings and maximising certain values. This task 
would have been impossible to do by hand. 

The general mixing bounds are tough barriers that seem difficult to 
break, though in many cases we expect mixing to occur with fewer colours. 
Therefore several proofs of mixing have been given for specific graphs or 
lattices. These proofs often involve computational assistance. We have 
mentioned two examples above as well as Salas and Sokal's 11-colour mix- 
ing bound on the triangular lattice [TI5]. Other examples of computer as- 
sisted proofs are those by Achlioptas, Molloy, Moore and van Bussel [I], 
who showed mixing for q = 6 colours on the grid, to which an alternative 
proof was given by Goldberg, Jalsenius, Martin and Paterson in [8]. Salas 
and Sokal gave in [19] a computer assisted proof for q = 6 colours on the 
kagome lattice, which was later improved to q = 5 colours by Jalsenius 



7 



in [TT]. It should also be mentioned that Jalsenius and Pedersen [12] have 
given a computer assisted proof of mixing with q = 7 colours for the grid 
when the dynamics is updating vertices in deterministic order, as opposed to 
the Glauber dynamics which chooses a vertex at random in each step. The 
new colour is still chosen at random, though. This result is an improvement 
of the non-computer assisted proofs by Pedersen |18J and Dyer, Goldberg 
and Jerrum 

The results on 9-colourings of the triangular lattice that we present in 
this paper are based on the 10-colour proof given by Goldberg, Martin and 
Paterson in [9]. Our 9-colour proof is of a whole different scale than the 10- 
colour proof and we must use computer assistance much more extensively 
and in more than one stage of the proof. The computations are rather 
demanding and prior to the final and rigourous results we had to use a 
heuristic to guide us in the right direction. We believe that the idea of 
such a heuristic could be useful to improve the mixing bounds for other 
lattices as well. However, our proof also demonstrates how demanding the 
computations can be, and unless new techniques are developed, there will 
probably be little progress in lowering the bounds for a vast number of 
lattices. 

4 Boundary pairs 

Similarly to Goldberg, Martin and Paterson in [9], we define two structures 
referred to as vertex-boundary pairs and edge-boundary pairs. Before stating 
the formal definitions, we give an overview of the two concepts. A vertex- 
boundary pair consists of a region and two partial colourings of its vertex 
boundary. The colourings are identical except for on one boundary vertex, 
which is not allowed to have the colour in either of the two colourings. 
That is, the vertex must have a "real" colour. An edge-boundary pair is 
similar to a vertex-boundary pair with the difference that the two boundary 
colourings are of the edge boundary instead of the vertex boundary. The 
formal definition of an edge-boundary pair might come across as slightly 
awkward as there are some additional conditions that must be met. The 
purpose of these conditions is to facilitate certain technicalities in the sub- 
sequent sections. 

Formally, a vertex-boundary pair X consists of 

• a region Rx, 

• a distinguished boundary vertex wx € dRx and 

• a pair [Bx , B'x ) of partial 9-colourings of dRx such that 

- Bxiy) = B'xiy) for all v G dRx \{wx}, Bx{wx) ^ B'x(wx) and 

- B x {wx) > and B' x (w x ) > 0. 
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An edge-boundary pair X consists of 



• a region Rx, 

• a distinguished boundary edge ex = {wx,vx} £ £Rx, where wx G 
dRx and vx G i?x are two distinguished vertices such that at most 
five neighbours of wx are in Rx, and 

• a pair (Bx,B'x) of partial 9-colourings of SRx such that 

- B x (e) = B' x (e) for all ee£R x \ {ex}, #x(ex) / B' x (e x ), 

- B x (e x ) > 0, B'x(ex) > 0, and 

- for any two clockwise adjacent edges e±,B2 G £Rx that share a 
vertex to G dR, B x {e\) = B x (e 2 ) or B' x {ei) = B' x (e 2 ). 

Note that the very last condition means that two clockwise adjacent edges 
have the same colour in both Bx and B'x unless one of the edges is ex- 

We let Ex = {{vx,u} | {vx,u} G -EV and u G denote the set of 
edges between vx and a vertex in Rx- 

For a vertex-boundary pair X (or edge-boundary pair X), a coupling \J r 
of 7Tg^ and ttq, x (or 7r^ x and ^b' x ) and t> G we define the indicator 
random variable 

t I 1, {C,C) is a pair of colourings drawn from \£ and C(u) 7^ C'(v); 

^ = \o, C(«)=C'(«). 

For an edge-boundary pair X, we define ^> x to be some coupling ^ of 
7i£ x and 7Tg/ x minimising E[l$ ; „ x ]. We define = E[l^ Xjt , x ]. For every 

pair (c, c') G [9] x [9] of colours, we let px(c, d) be the probability that, when 
a pair (C,C) of colourings is drawn from ^x, C(vx) = c and C'(vx) = d . 
Note that 

V(X)=E[U X , VX ]= PxM. 

c,c'e[9] 

c^c' 

5 Recursive coupling 

In order to show strong spatial mixing for q = 9 colours, we show that for all 
vertex-boundary pairs X and subregions R' C Rx, there exists a coupling 
* °f ^Sx an< ^ ^B'x sucn that 

decreases exponentially in the distance between wx and R' . As we will 
see, this implies strong spatial mixing for q = 9 colours. In order to show 
exponential decay, it will be convenient to work with edge-boundary pairs. 
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We closely follow the approach taken by Goldberg, Martin and Paterson 
in [9] and define a tree Tx associated with each edge-boundary pair X. 
The tree Tx is constructed as follows (Figure [3] illustrates an example of 
a tree Tx)- Start with a node r which will be the root of Tx- For every 
pair (c, c') G [9] x [9] of colours such that c ^ c', add an edge labelled 
(px(c,c'),vx) from r to a new node r cc /. If Ex is empty, r C)C / is a leaf. 
Otherwise, let ei, . . . , e& be the edges in Ex such that ex -< ei -< • • • -< e&. 
For each iG {1,..., let -Xj(c, c') be the edge-boundary pair consisting of 

• the region R x ,( c ,d) = Rx\ {vx}, 

• the distinguished boundary edge e Xi ( c ,d) = {wx t {c,d)^x x (c,d)} = e», 
where u>x i (c,c') = and 

• the pair (B Xl ( c ,d), B' Xl ( c ,d)) of partial 9-colourings of SR Xi ( c ,d) such 
that 

- B Xl ( c ,c'){e) = B x (e) for e G £R Xi (c,d) ^SR X , 

- B Xl ( c ,c'){e) = c' for e £ {ei, . . . ,ej_i}, 

- B Xi (c,d){e) = c for e 6 {e i5 . . -,e k }, and 

- B'x^ccf) ls identical to B Xi ( c ,d) on au edges but for which 
B'x t (c,c')(ei) = c'. 

Note that the properties of Xj(c, c') meet all the requirements for being 
an edge-boundary pair; the vertex h>jjq(c,c') has at most five neighbours in 
RxAc,d) (since wx ^ Rx l (c,c l ) is a neighbour of Wx i (c,c'))> an d any two clock- 
wise adjacent edges in ERx^d) that share a vertex in dRxA c ,d) have the 
same colour in B Xl ( c ,d) or B' Xi (c,d) (or both). 

Recursively construct TW C C /), the tree corresponding to edge-boundary 
pair Aj(c, c'). Add an edge with label (1, ■) from r CjC / to the root of T Xi ( c .d)- 
That completes the construction of Tx- 

We say that an edge e of Tx is degenerate if the second component of its 
label is For edges e and e' of Tx, we write e — > e' to denote the fact that 
e is an ancestor of e'. That is, either e = e', or e is a proper ancestor of e'. 
Define the level of edge e to be the number of non-degenerate edges on the 
path from the root down to, and including, e. Suppose that e is an edge of 
Tx with label (p,v). We say that the weight w(e) of edge e is p. Also the 
name n(e) of edge e is v. The likelihood 1(e) of e is He' e'-j-e w ( e )- The cosi 
7(t>,Tx) of a vertex u G i? x in T x is ^ e:n (e)=« ^( e )- 

Lemma 3 (Lemma 12 of Goldberg et al. [U]). For every edge-boundary pair 
X there exists a coupling ^ of 7r^ and ifi B , such that, for all v G 
E[l*,„] ^j(v,T x ). 

In the proof, given by Goldberg, Martin and Paterson in [9], the cou- 
pling is constructed recursively in the same manner as the tree Tx- The 
discrepancy at a given boundary vertex is broken to discrepancies at single 
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c,c')) = c ' 



L X 2 (c,c') 



L X 3 (c,c>) 



Figure 3: An example of the tree Tx- 
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boundary edges, so at every stage of the recursion, only pairs of colourings 
with a discrepancy at a sinle edge have to be considered (i.e., edge-boundary 
pairs). 

For an edge-boundary pair X and d ^ 1, we let Ed(X) denote the set of 
level-d edges in Tx- We define Td(X) = ^2 ee E d (x) ^( e )- 

Lemma 4. For every edge-boundary pair X and R C Rx there exists a 
coupling ^ of an< ^ ^B' x suc ^ ^at 

v&R d>d Rx {w x ,R) 

Proof. By Lemma there is a coupling ^ such that 

£E[1* )0 ] < ^ e E *(<o < E E ^) 

v£R veR v&R e:n(e)=v d^d Rx (w x ,R) e£E d (X) 

= E r *w- D 

d^d Rx (w x ,R) 



The following recursive definition of Fd(X) is equivalent to the definition 
above and will be useful in subsequent sections. 



E Vx{c,c) 



d = l; 



c,c'e[i 

c^c' 



Px(c,</)J2 T d-i(Xi(c,c/)), d>l 



1=1 



Note that Ti(X) = n{X). For a set U of edge-boundary pairs, we define 
T d (U) = m&xxeu T d {X). We define r d (0) = for all d. 



6 Exponential decay 

Our first step towards a proof of exponential decay and strong spatial mixing 
is to show that for any edge-boundary pair X, T^(X) decreases exponentially 
with d. A key ingredient in the proof is the quantity fi(X), for which we 
want to derive sufficiently good upper bounds. This is where we start. 

We use the lemma below by Goldberg, Martin and Paterson [9^. The 
idea is to shrink the region Rx so that vx is kept within the smaller region, 

1 In [5], Goldberg, Martin and Paterson define v{X) = E[l* Xjt , x ], which is the def- 
inition of fi(X) in this article. They give an alterative definition of /J.(X), however, as 
pointed out in the proof of Lemma 13 in [S], their fJ,(X) = v(X) indeed. 



12 



Ml = 


68809973/310505657 


Ml4 


= 25/91 


M27 


= 21/73 


M2 = 


11623551/51797443 


Mis 


= 9334/40215 


M28 


= 2833/11551 


M3 = 


456459/2005687 


Ml6 


= 11332/46633 


M29 


= 2833/11551 


M4 = 


408609/1601573 


Ml7 


= 11332/46633 


M30 


= 620/2321 


M5 = 


33/127 


Ml8 


= 7067/29188 


M31 


= 620/2321 


M6 = 


18199/78779 


Ml9 


= 11332/46633 


M32 


= 688/2389 


M7 = 


75312/325193 


M20 


= 775/2941 


M33 


= 5/17 


M8 = 


14165/58613 


M21 


= 4248/16015 


M34 


= 4/13 


M9 = 


70661/293514 


M22 


= 21/73 


M35 


= 4/13 


Mio = 


= 31648/123341 


M23 


= 5/17 


M36 


= 4/13 


Mil = 


= 2655/10063 


M24 


= 5/17 


M37 


= 5/17 


Ml2 = 


= 521/1853 


M25 


= 5/17 


M38 


= 4/13 


Ml3 = 


= 208/757 


M26 


= 32/113 


M39 


= 1/3 



Table 1: /ij is fi(X) maximised over all edge-boundary pairs X whose region 
Rx is the region Mj in Figure [H 



and use this smaller region to construct a new edge-boundary pair X' whose 
boundary colourings are identical to the boundary colourings of X on over- 
lapping boundary edges. The colours of the boundary edges introduced by 
shrinking Rx are chosen to maximise fi(X'). Then [J>(X) ^ fj,(X'). 

Lemma 5 (Lemma 13 of Goldberg et al. [9]). Suppose that X is an edge- 
boundary pair. Let R be any subset of Rx which includes vx ■ Let A be 
the set of edge-boundary pairs X' such that Rx 1 = R, ex 1 = &x, and, for 
e G £R X n£R x >, B x >(e) = B x (e) and B' x >{e) = B' x (e). Then fi(X) < 
max X / eA /i(X'). 

In this article, the 39 regions Mj illustrated in Figure [Hare of particular 
importance to us. In Section [8.21 we discuss why we chose these regions. In 
order to upper-bound fJ,(X) for an arbitrary edge-boundary pair X, we will 
shrink the region Rx down to one of the 39 regions and apply Lemma [SJ 
For i G [39] , we define the constants /ij according to Table Q] and prove the 
following lemma with the help of a computer. Details of the proof are given 
in Section [8[ 

Lemma 6. For i G [39], /J>(X) ^ Mi f or ever U edge-boundary pair X such 
that Rx is the region Mi in Figure [7] and vx and wx are the vertices labelled 
vm and wm> respectively. 

6.1 Regions and sets of edge-boundary pairs 

Let F be the region in Figure (5J where a vertex vf G F and wf G dF 
are labelled. Since F contains 12 vertices, we define F\, . . . , F2048 to be the 
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Part I of II 
Figure 4: The regions Mi, ... , M 39 . 
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Figure 5: The region F. 



Figure 6: The region G. 



2 = 2048 distinct subregions of F that all contain the vertex labelled vf- 
For i G [2048], we define Ui to be the set of edge-boundary pairs X such 
that the intersection of Rx and F is Fi, where vx and wx coincide with 
the vertices labelled vp and wf, respectively. Thus, any edge-boundary pair 
belongs to at least one of the sets U. 

Let G be the region in Figure El where a vertex vq G G and wq G dG 
are labelled. We define Q to be the set of subregions G' of G such that G' 
contains vq and at least one neighbour of wq is not in G' . 

We define a function $ : Q -> [39] x {0, . . . ,2048} 6 . Suppose G' £ G 
is a region. Then $(G") = (m, 6q> • • • j ^5) where m, bo, ... , 65 are uniquely 
specified as follows. Let M. C {Mi, . . . , M39} be the set of regions Mj such 
that Mj is a subregion of G' (where vq and wq coincide with vm and wm, 
respectively). M m is a region in M. such that \i m $C //j for all Mi & M.. 
When well defined, Fb , . . . , Ff, 5 are the intersections of G' and F taken 
according to Figure [Taj. . . J7fl respectively. If F^ is not well defined then we 
set hi = 0. For example, if the vertex above vq in region G (Figure ED is not 
in G' then none of the regions F\ , . . . , -F2048 is the intersection of G' and F 
in Figure ITdl hence 63 = 0. 

6.2 Upper bounds 

We use a computer to prove the next lemma. 

Lemma 7. There exist constants ai,...,«2048 G [1.1,5.0] suc/t that, for 
every region G' G Q, 

fj, m (a bl , . . . ,a b5 ) < a 6o (l - e) , (2) 

where (m, bo, . . . , 65) = 3>(G'), ao = 0, e = 1/1000 and i/ie values fx m are 
from Table{Jl 

The constants a» in the lemma above have been obtained in the fol- 
lowing way. We have written a computer program which goes through the 
regions G' of Q, calculates the values m,bo, ■ ■ ■ ,65 and adds Equation ([2]) 
above to a linear program. Once all regions in Q have been processed, the 
linear program contains thousands of inequalities. We then use a linear pro- 
gram solver to successfully find a satisfying solution such that each value 
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on is in the interval [1.1,5.0]. See Section [5] for more information on the 
computational part. 

Lemma 8. There exist constants ai, •••,02048 £ [1.1,5.0] such that, for 
i G [2048] and d^l, Y d (Ui) < a;(l - e) d , where e = 1/1000. 

Proof. Let oc\, . . . ,«2048 £ [1.1,5.0] be constants satisfying the inequalities 
in Lemma [3 For i G [2048], suppose X G C/j. We use induction on d to 
show that ^ ^(1 - e) d . 

For the base case d = 1, Ti(X) = < 1 < 1.1(1 - e) < Oi(l - e). 

For the inductive step d > 1, let G" be the intersection of Rx and G 
such that vx and tox coincide with vq and i«g, respectively. Hence G' G 

Let (m, 6q, . . . , 65) = $((?') and note that 60 = Since M m is a subregion 
of it follows from Lemmas [5] and [6] that fi(X) ^ fj, m . 

Recall that Ex is the set of edges between vx and a vertex in Rx- For 
r £ {1, . . . , \E X \} and (c, c') G [9] x [9] such that c ^ c' , let X r (c, c') be the 
edge-boundary pairs defined in the construction of the tree Tx in Section [5j 
From the definition of <£, the intersection of Rx r (c,d) an d F, taken such that 
v x r (c,c') an d w Xr ( c ,c') coincide with vp and wp, respectively, is Ff, k , where 
the value of k depends on the edge e r G Ex- Thus, with Uo = 0, 

\E X \ 5 

J2r d -i(x r ( c ,c')) ^ ^r^^j. (3) 

r=l k=l 
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We have 

\Ex\ 

T d (X) = Y, Px(c, c') Y, r d -i(X r (c, c')) (Definition of T d (X)) 

c,c'e[9] r=l 
5 

< Px^c^^d-iiUbJ (Equation©) 

c,c'e[9] k=l 
c^c' 

5 

= fi(X) Y T d-i(U bk ) (Definition of n(X)) 

k=l 

5 

^ l-i(X) y~]oife fc (l — e) d_1 (Induction hypothesis) 

fc=i 

< Mm(a6i + • • • + a fe5 )(l - e) d ~ l (/j,(X) ^ fi m ) 
^ Oj(l — e) d . (Lemma [7| and &o = i) 

□ 

The next corollary follows immediately from Lemma El 

Corollary 9. For every edge-boundary pair X and d ^ 1, r^(X) ^ 5(1— e) d ; 
w/iere e = 1/1000. 

Lemma 10. For every edge-boundary pair X and subregion R C Rx there 
exists a coupling of and 7r^, such that 

where s = 1/1000. 

Proof. Follows from Lemma 0] and Corollary [9j □ 

6.3 Vertex-boundary pairs and strong spatial mixing 

Similarly to Lemma fTUl we prove the following lemma for vertex-boundary 
pairs. 

Lemma 11. For every vertex-boundary pair X and subregion R C Rx there 
exists a coupling ^ of and ^%t x such that 

50 (w x ,R) 



v<=R 

where e = 1/1000. 
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Figure 8: A vertex-boundary pair X broken into three edge-boundary pairs 
Xi, X2 and X3. The numbers represent the colours of the boundary vertices 
and boundary edges, respectively. 



Proof. Let X be any vertex-boundary pair and let R C Rx. First suppose 
that wx has a neighbour y ^ Rx- Let E = {ei,...,e&} C be the 

boundary edges incident to wx such that {wx,y} -< ei -<•••-< e^. For 
i G [&], let Xj be the edge-boundary pair consisting of 

• the region Rx t = Rx, 

• the distinguished boundary edge exi = 

• the pair (Bxi, B' Xi ) of partial 9-colourings of £Rxi such that 

- B Xi ({w,v}) = B' Xi ({w,v}) = B x (w) for {w,v} G £R X \E, 
where w G 

- £ Xi (ei) = S^fo) = B'xK) for j G {1, ... ,i - 1}, 

- B Xl (ei) = B x (w x ) and B' Xi (ei) = B' x (w x ), and 

- 5 Xi (ej) = B'xtfe) = B x {w x ) far j € {» + 1, . . . , fc}. 

Note that 7r| x = 71"^ and tt^, x = ■ Figure [8] illustrates an example of 
how the edge-boundary pairs Xj are constructed. 

We use Lemma [10] and for i G [A;] we let ^ be a coupling of 7r^ and 

tt 9 b , such that 

Xi 

j2ni^, v ]^^-^ dRx(wx ' R) ■ (4) 

We define a coupling \F of tt^ x and 7ig, by composing the couplings 
^1, . . . , vpfc as follows. Let (Co,Ci), (Ci,C2), . . . , (Cfc_i,Cfc) be pairs of colour- 
ings drawn from . . . , ^fc, respectively. Then (Co, is the pair of colour- 
ings drawn from \t. If, for v G R, Cq(v) ^ Ck(v), then Cj-i(v) / Cj(^) for 
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some j G [k] . Hence 

k 

i=i 

From Equations ([!]) and (0), and A: being at most 5, we have 

X e[i»,„] < E E = E E < t (1 " £ ) dHx( " x,ii) • 

vER v£R i=l i=l v€R 

Lastly, suppose that all neighbours of wx are in Rx- A technical detail 
arises here because we can no longer break the discrepancy at wx into edge- 
boundary pairs Xi as above. Instead we will first randomly choose a colour 
of a neighbour u of vox and then define edge-boundary pairs for the region 
R x \ {u}. 

Let u be any neighbour of wx and let R u = Rx \ {u}. We define a 
coupling \£ of 7r| x and follows. Let C and C be colourings drawn 

(independently) from ttq and 7Tg, , respectively. Let B and B' be the two 
colourings of dR u such that = Bx(w) and = B'x{w) for u; G 

<9f?x n dRu, and #(u) = C(u) and = C'(u). Let be a coupling of 

7Tg and 7Tg, which we will define shortly. In a pair of colourings drawn from 
^, the vertex u is assigned the colours C(u) and C'(u), respectively, and 
the other vertices of R u are assigned colours according to ^f'. It remains to 
define 

Let E = {ei, . . . , e^} C £R U be the boundary edges incident to either 
wx or u such that e\ is incident to wj, and {wx,u} -< ei -<•••-< eg and 
-< e6 -< • • • -< efc. Note that five neighbours of wx are in R u and 
both e6 and are edges between u and neighbours of wx. An example is 
illustrated in Figure El 

Similarly to above, we define k edge-boundary pairs. For i G [k], let Xi be 
the edge-boundary pair consisting of the region Rx { = R U i the distinguished 
edge exi = and boundary colourings that differ on ex 4 - Note that 7r| = 
7Tn and n%, = tt n/ . See Figure [9] for an example. 

x 1 D x k 

Let be a coupling of Tr% x and 7I"b/ x such that 

J]E[1« (| ,] s$ £(i_ e )^«x(«'x.ii)-i . 

The —1 in the exponent comes from the fact that for some Xi, the edge exi 
is incident to u and cIr x (u, R) might be dj} x (wx, R) — 1 for some R. 

The coupling is defined by composing the couplings Vl/i, . . . , Thus, 
with being at most 10, 

E em < E E E W = E E < f a - e)**^- 1 . 

veR v£R i=l i=l -yG-R 

□ 
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Figure 9: An example of edge boundary colourings of the edge-boundary 
pairs X±, . . . , X\q constructed from a vertex-boundary pair X for which all 
six neighbours of wx are in Rx- Here we suppose that Bx{wx) = 1 and 
B' x (w x ) = 2, and C{u) = 3 and C'(u) = 4. 

The main theorem of the paper can now be proved. 

Theorem 1. There is strong spatial mixing for 9 colours on the triangular 
lattice. 

Proof. Let R by any region, R' C R, w € dR, and B and B' two partial 
9-colourings of dR that are identical on all vertices except for w. Suppose 
that $ is a coupling of 7r| and ir% and let (C,C) be a pair of colourings 
drawn from Then 

d tv (ir B (R'),ir B ,(R')) < Pr(C / C on R 1 ) < E[l*,„] . (6) 

Let X be the vertex-boundary pair such that Rx = R, wx = w , &x = B 
and B'x = B'. Let e = 1/1000, /3 = and /3' = - ln(l - e). Note that a 

coupling of 7r| and 7Tg, is also a coupling of 7r| and 7Tg,. Using Lemma [IT] 
we know there is a coupling of ir Bx and 7Tg/ such that 

E[l*',«] < p e -P**x<t°°m . (7) 
Let ^ = Strong spatial mixing follows from Equations ([6]) and ([7]). □ 

7 Rapidly mixing Glauber dynamics 

We use Theorem 8 of Goldberg, Martin and Paterson |9J to show that the 
Glauber dynamics is rapidly mixing. Before applying their theorem we must 
introduce some notation. 
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Let BalLj(u) denote the set of vertices in Vqr that are at distance at 
most d from the vertex v € Vf. Thus we have Ballo(w) = {v}. From the 
definition of the triangular lattice it follows that |9BalLj(u)|, the number 
of vertices at distance d + 1 from v, is of order 0(d), hence |BalLj(u)| E 
@(d 2 ). One can show that |c?BalLj(«)| = 6(d + 1) but we do not need to 
be that precise here (see Lemma 2.22 in |10| for a proof). Since, for all v, 
|3Ball^(v)|/|Ball ( i(t | )| — > as d — > oo, the triangular lattice is neighbourhood- 
amenable (see Definition 2 in [9]). 

From Lemma [IT] it follows that for every vertex-boundary pair X there 
is a coupling ^ of 7r| x and ^%i x such that 

50 A 

£ E[1 *^ ] ^ 1/1000.(1-1/1000) = 7 ' (8) 

v&Rx 

where A = 6 is the maximum degree of the triangular lattice and e is the 
appropriate constant. Thus, according to Definition 4 of Goldberg, Martin 
and Paterson [9], the triangular lattice has a so called e-coupling cover. 
We can now use Theorem 8 in [9] and state our result. 

Theorem 2. The Glauber dynamics on 9-colourings of a region R of the 
triangular lattice, with partial 9- colouring B of dR, has mixing time Tg(J) 6 
0(n 2 + ralog k), where n = \R\. 



8 Computations 

In this section we go through the computational steps involved in obtaining 
the 39 //-values in Table [1] and proving Lemma [71 The computations are 
rather demanding and took around two weeks to finish on a fairly powerful 
home PC of year 2006. 

The implementation of the programs, written in C, can be found at 
http : //www . cs . bris . ac . uk/home/markus/papers/triangular/ 

8.1 Computing the /i- values in Tabled] 

Suppose that X is an edge-boundary pair. Let Q be the set of colourings 
C € such that C(vx) = Bx(ex), and let fi' be the set of colourings 

C € fl 9 Bx such that C(vx) = B' x {ex)- Let Oboth = n Q 9 B , x be the set 
of proper 9-colourings of Rx that agree with both Bx and B'x- 

Lemma 12. For any edge-boundary pair X, 

max{|»|,|»'|} 

Ml ' \n hoth \ + max{\n\,\n'\}- 
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Figure 10: An edge-boundary pair X with region Rx = M\ and boundary 
edges coloured c\ , . . . , cig . 

Proof. Recall from the definition of fi(X) that n(X) = K[l^ XtVx ]. 

Suppose first that ^ In order to minimise E[l^.^ x ] we construct 
a coupling ^ of ttb x and ttb'x as follows. Let (C,C) be a pair of colourings 
drawn from If C'(vx) = Bxie-x) then C(vx) ^ C'(vx) because C is drawn 
from ttb x , preventing vx from receiving the colour Bx(ex)- However, if 
C'{vx) 7^ Bx(ex) then we choose C(vx) = C'(vx), which is always possible 
under the assumption that |0| ^ |0'|. Hence E[l$ x>1)x ] = |fi|/(|fibothl + l^l)- 

Suppose second that |fi'| ^ By symmetry we have that E[l# x Vx ] = 

\n'\/(\n hoth \ + |o'|). □ 

For each of the 39 regions Mj in Figured] we have written a program in C 
which computes /ij. We use the region Mi to illustrate how /xi is obtained. 
The other [i- values are computed similarly. 

Let X be an edge-boundary pair such that Rx = Mi, t>x = = 
to a/ , -Bx and B'x assign the colours c± , . . . , cig to the boundary edges in 
\ {ex}, B x (e x ) = 1 and B' x {e x ) = 2. See FigureQjH 

For i € [9], let rij be the number of proper 9-colourings of i?x that agree 
with the colouring ci,...,cis of the boundary, disregarding the colour of 
the edge ex, and assign the colour i to vx- Thus, n\ = ri2 = and 
^3 + • • • + ng = |Oboth|- We write a subroutine that computes the values rii 
given the colours c\ , . . . , cig . Computing them in a brute force manner will 
take too long so we must be a little more clever than that. We construct 
a dynamic programming table which lets us reuse the number of colourings 
computed for subsets of the region Rx- For details, see the code on the 
webpage given above. 

Let m = n\/{ni + + ■ ■ ■ + ng) be a function of c%, . . . , cig. We loop 
through the colours ci, . . . , cig, and for each configuration we compute m. 
It follows that /ii is the largest value of m that we ecounter. 

Each colour Cj can take a value from the set {0,1,..., 9}. Looping 
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through all 10 configurations of C\,...,cis yields an unnecessary large 
number of redundant boundary colourings. Instead of considering all 10 18 
configurations, we keep the number down by making a few useful observa- 
tions: 

• Swapping the colours c\q and cyj will not change the value m. Hence 
we may skip colourings for which cig > cyj. 

• The colours 0, 1 and 2 have a special meaning here since symbolises 
"no colour" and 1 and 2 are used on the edge ex- However, the 
colours 3, . . . , 9 are merely labels and therefore there is no reason to 
use a colour d G {4, . . . , 9} for Cj unless the colour d — 1 has been used 
for some Cj/, where j' < j. 

• From the definition of an edge-boundary pair, ci,cis € {1,2} and 

Ci ^ Ci 8 . 

Obtaining all 39 ^-values by using tricks like those above, took around 
two weeks on a fairly powerful home PC of year 2006. More precisely, we 
left a computer running non-stop for 24 hours per day without using it for 
other purposes. Note that we have implemented our programs with fast and 
optimised C code. 

8.2 The experimental phase 

A very reasonable question to ask is why we used exactly those 39 regions Mi 
in Figure H] and the regions G and F. The regions are the result of a rather 
long experimental phase where we started with a set of smaller regions and 
carried out the computations as described in this article. Initially, we failed 
to prove Lemma [71 That is, we were not able to find constants ai such that 
the equation in Lemma [7] would hold for all regions in Q. The reason for 
this is twofold: too small sizes of G and F do not allow enough recursions, 
and too small regions M, yield too large values /ij. Gradually we increased 
the sizes of the regions until Lemma [7J could successfully be proved. 

While increasing the sizes of G and F, we also computed /u-values for 
growing regions Mj. In total we considered a few hundred distinct regions 
Mi , of which some were even larger than the region M\ in Figure HI One 
might ask how we managed to compute the \i- values for such a vast number 
of regions given that it took two weeks of computations for the 39 regions in 
Figure HI Instead of computing the fi- values exactly, we used a hill climbing 
technique where we randomised colourings of the boundary, to which we 
iteratively made small changes, whereby larger and larger values m (see 
previous section) were found. This process allowed us to build a "library" 
of randomised /i-values. In practice, we let a computer run during the night 
over some time in order to obtain hopefully good estimates of the ^-values. 
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Interestingly, it turned out that for many regions, only a few minutes running 
time was enough to yield a value of m that did not seem to increase further. 
For such regions we stopped the hill climbing process after a couple of hours. 

Once Lemma [7] had been successfully proved with randomised //-value 
estimates, we were faced with the task of computing the exact /i-values. 
Initially the set of M-regions in the proof was rather large, so first we pruned 
the set by carefully choosing regions to throw away. Eventually we ended 
up with the 39 regions in Figure for which the //-values were computed 
exactly. It is interesting to note that the 39 randomised //-value estimates 
were identical to the exact values. 

The successful use of the //-value estimates suggests that our approach 
could be used to prove better mixing bounds for other lattices. Although the 
system of inequalities that was solved in the proof of Lemma [7] contained a 
huge number of inequalities (around 100,000 after throwing away redundant 
inequalities), the real bottleneck seemed to be the demanding computations 
of the //-values. 
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